Skip to main content

Reply to this post | Go Back
View Post [edit]

Poster: MaureenKennedy Date: May 5, 2010 7:26pm
Forum: texts Subject: It's time for formal organization of IA's books

With the hundreds of pages of government text, and google uploads, I think it is time for IA to organize uploads into topics instead of simply by title and authors. Many uploads (google) have no subject in their header so that search feature is useless. At page 200 of incoming uploads there was still government court documents clogging the system with no author names.

My favorite past time used to be coming here each night and reading thru the daily uploads just browsing for some interesting books to download. Can't do that any more with all the junk being uploaded which is specific in it's interest. Even a library uses a subject classification (dewey) before reverting to author or title.

What's the point in having millions of books when you can't peruse them anymore. And yes, I have used the minus government text someone mentioned below - doing so does not seem to give you the latest uploaded books.

The uploading contributors seem to do their inputs according to what section of their library they are scanning, ie sometimes there are pages and pages of foreign language books - which is good for foreign researches but useless for the majority of the US readers. Rather than individual library upload lists, perhaps subject lists like foreign language books, governmental text, science, etc should be considered and books uploaded into their category.

Reply to this post
Reply [edit]

Poster: Time Traveller Date: May 5, 2010 10:32pm
Forum: texts Subject: Re: It's time for formal organization of IA's books

I get notified of all new texts via RSS feed, I just went thru a 2000 item backlog and found nothing worthwhile, due to all the government reports.

That stuff needs to be indexed into categories, otherwise soon we will get to the point that searches will bring up so many useless results, that few people will use the IA, just trained researched and historians.

Peter (2000 items = about 48 hours of uploads)

Reply to this post
Reply [edit]

Poster: MaureenKennedy Date: May 5, 2010 11:07pm
Forum: texts Subject: Re: It's time for formal organization of IA's books

Just for your info - doing text minus the government collection brings the book listing up from the first upload. Meaning if you want to see the most recent books, hit last which takes you to page 39 thousand of so, and work you way backwards for the newest listings! Heck of a way to browse.

Just for fun I ran federal government reports and find that there are 7,222 PAGES of about 53 listings a page for a total of 382,766 files. I think they should store them on their own website!

This post was modified by MaureenKennedy on 2010-05-06 06:07:50

Reply to this post
Reply [edit]

Poster: stbalbach Date: May 6, 2010 7:09am
Forum: texts Subject: Re: It's time for formal organization of IA's books

By default there is a link called "See recent additions" on the main texts page. This shows everything uploaded to Internet Archive sorted most recent first, including user uploads, government documents and foreign language texts. For casual browsing purposes, it may be expedient to filter some of this out. Below is an example of how it is done.

This link shows the most recent additions using the following search: [ mediatype:(texts) -collection:(usfederalcourts) -collection:(opensource) AND (language:eng OR language:English OR language:Eng) ]. In other words, it will exclude the government collection, exclude the opensource collection, which has a lot of spam, and will list only English-language works.

You can fine tune the search to your taste, and then bookmark it and use your bookmarked link.

Remember: the "Sort results by: Date added" must be manually clicked on the right side of the screen in order to sort the results by most recent first. Or, add "&sort=-publicdate" at the end of the URL. Sorting by date can not be set via the search string, only by manually changing the URL or clicking the option manually. Once you have the URL set, bookmark it for future use.

Stephen

Reply to this post
Reply [edit]

Poster: martyveldman Date: May 6, 2010 7:50am
Forum: texts Subject: Re: It's time for formal organization of IA's books

In the Sub-Collections, you can click on them also to see the latest additions and they are somewhat catagorized. The Opensource books have some very interesting items and haven't seen any spam lately. Thanks for the link Stephen. I can't seem to figure out how the advanced search works even with instructions! How about a link called 'IA for Dummies'!

Reply to this post
Reply [edit]

Poster: stbalbach Date: May 6, 2010 8:06am
Forum: texts Subject: Re: It's time for formal organization of IA's books

I wrote an informal FAQ that may be of some help
http://docs.google.com/Doc?id=dc2wxqtd_1cs2hzvnh

If you have any Q on searching let me know. The IA system is really pretty complex because the meta data is not uniform. Searching for "John Jacob Astor" is an entirely different strategy to searching for "Sir Winston Churchill". I *could* write a generic search string that took into account all possibilities so one doesn't have to think about it, but the search field has a length limit, making that impossible. So one has to build complex custom searches each time.

I think they are trying to bring some order to it with Open Library. A "For Dummies" guide would be perfect :)

Stephen

Reply to this post
Reply [edit]

Poster: Time Traveller Date: May 6, 2010 11:04pm
Forum: texts Subject: Re: It's time for formal organization of IA's books

BTY, is not a link "IA for Dummies" a breach of a brand name?

Hey Gerry, please comment.

Peter

(I wonder if my name has been trade marked)

Reply to this post
Reply [edit]

Poster: Time Traveller Date: May 6, 2010 10:54pm
Forum: texts Subject: Re: It's time for formal organization of IA's books

Yes, they should store the stuff themselves, but hey, there is a recession, and the IA is run on charity.

And on the video Internet Archive, somebody, or NASA is uploading all the videos that you can also download from the NASA servers.

NASA will have proper archiving, so I wonder if its just a sad person with nothing better to do than move videos from NASA to the IA, using up IA paid for bandwidth and storage.

If such stuff has to be archived or catalogued by the IA, why not just post links, linking back to government servers, like NASA's.

Peter.

Reply to this post
Reply [edit]

Poster: martyveldman Date: May 6, 2010 7:22am
Forum: texts Subject: Re: It's time for formal organization of IA's books

Peter, I agree. There should be language catagories as well as BROWSE BOOKS BY SUBJECT as on the Princeton Seminary site. We should also be able to browse the latest uploads the same way. Government reports belong on Government sites.
Marty

Reply to this post
Reply [edit]

Poster: garthus Date: May 6, 2010 11:49am
Forum: texts Subject: Re: It's time for formal organization of IA's books

Maureen,

I would not get to upset over this ... prime directive is to upload material. All of my uploads have good metadat for th subject field:

http://www.archive.org/search.php?query=garthus&;sort=-publicdate

I kept bugging them about what you said; I have already volunteered some of my time to do this but I get no reponse from the appropriate people. While I do not think that this is a catastrophe ... it would be good to start doing it now before the collectiosn get even bigger ... also I do not think that government reports are junk; they are just informtion like averything else. Look at some of the bluegrass music for example; some of the materials being posted are very nice. My argument is that they are not making full use of the talents and skills of the volunteers out there; but what you talk about can lways be done later; getting books up should till be the prrime directive.

Gerry

Reply to this post
Reply [edit]

Poster: Time Traveller Date: May 6, 2010 10:21pm
Forum: texts Subject: Re: It's time for formal organization of IA's books

as long as the people at the IA keep closed ranks, they make themselves the Elite by default

Peter